Multi-Observation Regression
نویسندگان
چکیده
Recent work introduced loss functions which measure the error of a prediction based on multiple simultaneous observations or outcomes. In this paper, we explore the theoretical and practical questions that arise when using such multi-observation losses for regression on data sets of (x, y) pairs. When a loss depends on only one observation, the average empirical loss decomposes by applying the loss to each pair, but for the multi-observation case, empirical loss is not even welldefined, and the possibility of statistical guarantees is unclear without several (x, y) pairs with exactly the same x value. We propose four algorithms formalizing the concept of empirical risk minimization for this problem, two of which have statistical guarantees in settings allowing both slow and fast convergence rates, but which are out-performed empirically by the other two. Empirical results demonstrate practicality of these algorithms in low-dimensional settings, while lower bounds demonstrate intrinsic difficulty in higher dimensions. Finally, we demonstrate the potential benefit of the algorithms over natural baselines that use traditional single-observation losses via both lower bounds and simulations.
منابع مشابه
Regression Analysis under Inverse Gaussian Model: Repeated Observation Case
Traditional regression analyses assume normality of observations and independence of mean and variance. However, there are many examples in science and Technology where the observations come from a skewed distribution and moreover there is a functional dependence between variance and mean. In this article, we propose a method for regression analysis under Inverse Gaussian model when th...
متن کاملComparative analysis of different uni- and multi-variate methods for estimation of vegetation water content using hyper-spectral measurements
Assessment of vegetation water content is critical for monitoring vegetation condition, detecting plant water stress, assessing the risk of forest fires and evaluating water status for irrigation. The main objective of this study was to investigate the performance of various monoand multi-variate statistical methods for estimating vegetation water content (VWC) from hyper-spectral data. Hyper-s...
متن کاملMulti-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics
Numerous deep learning applications benefit from multi-task learning with multiple regression and classification objectives. In this paper we make the observation that the performance of such systems is strongly dependent on the relative weighting between each task’s loss. Tuning these weights by hand is a difficult and expensive process, making multi-task learning prohibitive in practice. We p...
متن کاملThe Comparison of Multi-variable Linear Regression and Artificial Neutral Networks in Tax Evasion of Legal Persons in Iranian Tax System
Tax evasion is one of the most important problems of tax system in the most countries around the world. It covers any unlawful attempt to avoid paying taxes. In present study, the affective factors on tax evasion based on experts’ views were extracted by using Delphi method, so we identified 29 factors and finally 16 factors were extracted based on measurement ability among them. The statistica...
متن کاملMORD: Multi-class Classifier for Ordinal Regression
We show that classification rules used in ordinal regression are equivalent to a certain class of linear multi-class classifiers. This observation not only allows to design new learning algorithms for ordinal regression using existing methods for multi-class classification but it also allows to derive new models for ordinal regression. For example, one can convert learning of ordinal classifier...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1802.09680 شماره
صفحات -
تاریخ انتشار 2018